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Abstract 

For a nonnegative matrix A and real diagonal matrix D, two known in- 
equalities on the spectral radius, r(A 2 D 2 ) > r(AD) 2 and r(A)r(AD 2 ) > 
r(AD) 2 , leave open the question of what determines the order of r(A D ) 
with respect to r(A)r(AD 2 ). Here, sufficient conditions are found on A 
that determine orders in either direction. For a symmetrizable nonnega- 
tive matrix A with all positive eigenvalues and nonnegative D, r(A 2 D) < 
r(A) r(AD). The reverse holds if all of the eigenvalues of A are negative 
besides the Perron root. This is a particular case of the more general 
result that r(A[(l — m) r(B) I + mB]D) is monotonic in m when all non- 
Perron eigenvalues have the same sign — decreasing for positive signs 
and increasing for negative signs, for symmetrizable nonnegative A and 
B that commute. Commuting matrices include the Kronecker products 
A,B £ {®^iM'*}, ti G {0,1,2,...}, which comprise a class of appli- 
cation for these results. This machinery allows analysis of the sign of 
d/drrij r({®f =1 [(1— rrii) r(Ai)Ii + miA;]}D). The eigenvalue sign condi- 
tions also provide lower or upper bounds to the harmonic mean of the 
expected sojourn times of Markov chains. These inequalities appear in 
the asymptotic growth rates of viral quasispecies, models for the evolu- 
tion of dispersal in random environments, and the evolution of site-specific 
mutation rates over the entire genome. 
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1 Introduction 

In a recent paper, Cohen [9] compares two inequalities on the spectral radii, r, 
of products involving nonnegative square matrix A and positive diagonal matrix 
D: 

r(A 2 D 2 ) > r(AD) 2 , (1) 
r(A)r(AD 2 ) > r(AD) 2 . (2) 

Inequality (JXJ) is obtained in [10], while inequality ([2]) is obtained in [9 . The 
relationship between two the left-hand side expressions in the inequalities is not 
determined. Cohen notes that positive matrices A and real diagonal matrices 
D can be chosen to give either 

r(A 2 D 2 ) > r(A)r(AD 2 ) or r(A 2 D 2 ) < r(A)r(AD 2 ), (3) 

and asks whether conditions may be found that guarantee a direction to the 
inequality. 

This seemingly narrow question is intimately related to a very broad class of 
problems that arise repeatedly in ecological and evolutionary dynamics [3J [5J 3] . 
Specifically, it relates to the open question of what conditions on A, B and D 
determine the sign of dr([(l — m)A + raB]D)/dm. 

Two results from the population biology literature provide conditions that 
determine the direction of inequality in ([3j when A is a stochastic matrix. Both 
results rely on the constraint that the stochastic matrix be symmetrizable: 

Definition 1 (Symmetrizable Matrix). A square matrix A is called symmetriz- 
able to a symmetric matrix S if it can be represented as a product A = D|_SDr , 
where D|_ and Dr are positive diagonal matrices. 

Conditions for the inequality r(A 2 D) < r(A) r(AD) are found in the fol- 
lowing theorem of the late Sam Karlin. 

Theorem 2 (OH Theorem 5.1, pp. 114-116, 197-198]). Consider a family T 
of stochastic matrices that commute and are simultaneously symmetrizable to 
positive definite matrices, i.e.: 

T:={Mi=T> L SiT>R:M h M k =M k M h }, (4) 

where D|_ and Dr are positive diagonal matrices, and each is a positive 
definite symmetric nonnegative matrix. Let D be a positive diagonal matrix. 
Then for each M h ,M k e T: r(M h M fc D) < r(M fe D). 

The theorem applies to (|3j by constraining D 2 to be a positive diagonal 
matrix and substituting = = A, and r(A) = 1, which yields r(A 2 D 2 ) < 
r(A)r(AD 2 ). 

Theorem [5] is extended in [3] to conditions that make the spectral radius 
monotonic over a homotopy from to M/jMfe. This monotonicity, either in- 
creasing or decreasing, establishes inequalities in each direction between r(A 2 D) 
and r(A)r(AD). 



Spectral Radius Product r(A)r(AD) Versus r(A 2 D) 



3 



Theorem 3 (From [31 Theorem 33]). Let P and Q be transition matrices of 
reversible ergodic Markov chains that commute with each other. Let D ^ c I be 
a positive diagonal matrix, and define 

M(m):=P[(l-m)I + mQ], me [0,1]. (5) 

If all eigenvalues o/P are positive, then dr(M(m)D)/dm < 0. If all eigenvalues 
o/P other than Ai(P) = 1 are negative, then dr(M(m)D)/dm > 0. 

The condition in Theorem [5] that the matrices be symmetrizable is shown in 
[2 Lemma 2] to be equivalent to their being the transition matrices of reversible 
Markov chains. Theorem |3j yields Theorem [2] by letting = P and = Q. 
Then M(0) = M fc and M(l) = M h M k = M k M h . The hypothesis that M fc 
is symmetrizable to a positive definite matrix means that has all positive 
eigenvalues, so dr(M(m)D)/dTO < for any positive diagonal D, and thus 
r(M h M k D) = r(M fe M, l D) < r(M fe D). Note that the eigenvalues of M h arc 
generally irrelevant to this inequality. 

For the inequality in the reverse direction, r(A 2 D) > r(A)r(AD), let all 
the eigenvalues of A other than r(A) = Ai(A) = 1 be negative and substitute 
A = P = Q, so M(0) = A and M(l) = A 2 . The result dr(M(m)D)/dm > 
yields r(A 2 D) > r(A)r(AD) for such a stochastic symmetrizable matrix A. 

In the present paper, Theorem |3j is generalized to all symmetrizable irre- 
ducible nonnegative matrices. 

2 Results 

The following notational conventions are used. The elements of a matrix A are 
[A]jj = Aij, the columns are [A]j, and the rows are [A]\ A diagonal matrix 
with elements of a vector x along the diagonal is D x = diag [x] . When D ^ cl 
for any c £ K, D is called nonscalar. The vector with 1 in position i and zeros 
elsewhere is ej. 

We review the properties of irreducible nonnegative n x n matrices. When 
A is irreducible then for each there is some t £ N such that [A*]y > 0. 
The eigenvalues of A are represented as A, (A), i = 1, . . . ,n, and the spectral 
radius by r(A) := nm^i,.. „ |A»|. We recall from Perron-Frobenius theory 
that r(A) is a simple eigenvalue of A, called the Perron root, designated here as 
r a = r(A) = Ai (A). The non-Perron eigenvalues are \ai = Aj(A), i = 2, . . . , n. 
Let v(A) and u(A) T be the right and left Perron vectors of A, the eigenvectors 
associated with the Perron root, normalized so that e T v(A) = u(A) T v(A) = 1, 
where e is the vector of ones. Since A is irreducible, from Perron-Frobenius 
theory, v(A) and u(A) T are strictly positive and unique. 

The goal is to provide conditions that make the spectral radius of A[(l — 
77i ) rs I + rnB]D monotonic in m. We proceed as follows: first, A and B are 
constrained to commute and be symmetrizable, which allows them to be simul- 
taneously represented by the canonical form (|6]); second, this form is used to 
show that its spectral radius can be represented as a sum of squares; finally, the 
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derivative of the spectral radius is represented as a sum of squares, and this is 
utilized to give conditions that determine its sign. 

The following representation of symmetrizable matrices is used throughout. 
It arises for the special case of transition matrices of reversible Markov chains 
US p. 33]: 

Lemma 4 (Canonical Form for Symmetrizable Matrices [2] Lemma 1]). A 
symmetrizable matrix A = DlSDr, where S is symmetric and D|_ and Dr are 
positive diagonal matrices, can always be put into a canonical form 

A = D L SD R = EKAK T E \ (6) 

where E = Dl /2 D r 1/2 is a positive diagonal matrix, K is an orthogonal matrix, 
K T is its transpose, A is a diagonal matrix of the eigenvalues of A, the columns 
o/EK are right eigenvectors of A, and the rows o/K T E _1 are left eigenvectors 
of A. 

Proof. First it is shown that for arbitrary symmetric S there is a symmetric S 
such that A = D L SD R = ESE -1 . Direct substitution of E = Dl /2 D r 1/2 gives 

D l SDr = Dl /2 Dr 1/2 SDl 1/2 D r /2 

S = D- 1/2 D R /2 D L SD R D^ /2 D R 1/2 = Di /2 D R /2 SDi /2 D R /2 , 

which is symmetric. Symmetric S has a symmetric Jordan canonical form S = 
KAK T where K is an orthogonal matrix and A is a matrix of the eigenvalues of 
S, which by construction are also the eigenvalues of A. Hence A = D|_SDr = 
ESE" 1 = EKAK T E Let [EK], be the ith column of EK. Then 

A[EK], = EKAK T E _1 [EK], = EKAK T [K], = EKAe, = A ! EKe ! = A i; [EK] 4 

hence [EK],; is a right eigenvector of A. The analogous derivation shows the 
rows of K T E _1 to be left eigenvectors of A. □ 

Lemma 5 (Canonical Form for Commuting Symmetrizable A and B). Let A 

and B be n x n symmetrizable irreducible nonnegative matrices that commute 
with each other. Then A and B can be decomposed as 

A = DV^-^KA^D^/X 72 , (7) 
B = Di^D-^KAsKTD-^DV 2 , (8) 

where v = v(A) = v(B), u = u(A) = u(B), K is an orthogonal matrix, 

[K]i =Cv /2 Du /2 e, and Aa and A b are diagonal matrices of the eigenvalues of 
A and B, respectively. 

Proof. Since A and B are symmetrizable, each can be represented by canonical 
form ([6]). A and B are diagonalizable since A in ([6]) is a diagonal matrix 
and (EK) -1 = K T E _1 . Since A and B commute by hypothesis, they can be 
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simultaneously diagonalized [H] Theorem 1.3.19, p. 52], which means there 
exists an invertible X such that A = XA^X -1 and B = XAgX -1 . Clearly 
the columns of X are right eigenvectors of A and B, and the rows of X -1 are 
left eigenvectors of A and B, since 

A[X] 4 = XA^X- x [X]i = XA A e s - XA 4 (A)e, = Ai(A)[X]<, 

etc.. We can set X = EK to give 

A = XAaX" 1 = EKAa(EK)" 1 = EKA A K T E^ 
B = XAsX" 1 = EKAb(EK)" 1 = EKAflK T E _1 . 

Without loss of generality, the Perron root is given index 1, so r(A) = Ai(A), 
r(B) = Ai(B). Since A and B are irreducible, 

v(A) = [EK]i = v(B) = v > 0, (9) 
u(A) T = [J^E" 1 ] 1 = u(B) T e u T > 0. (10) 

Next, E is solved in terms of u T and v: 

[EK]! = E[K]i = v, so [K]i = E _1 v, 

and 

[I^E" 1 ] 1 = [K T ] 1 E~ 1 = u T so [tC] 1 = u T E, 

which combined give K\j — EJ Vj = UjEj, hence Ej — vj/uj so Ej — \/vj/uj 
and 

E = D^D,: 1 / 2 . (11) 
The first column of K evaluates to 

[K] x = E-V = D- 1 /2 D y 2 v = Dy 2 Dy 2 e. (12) 

□ 

Theorem 6 (Sum-of-Squares Solution for the Spectral Radius). 
Let A and B be nx n symmetrizable irreducible nonnegative matrices that com- 
mute. Let rA = r(A.) = Xai and rB = r(H) — Xbi refer to their Perron roots, 
and {\Ai} and {Xbi} represent all of the eigenvalues of A and B, respectively. 
Let u T and v be the common left and right Perron vectors of A and B (Lemma 
[5|). Let D be a positive diagonal matrix and define 

M(m) := A[(l -m)r B I + mB], me [0,1]. 

Let v(m) = v(M(m)D) and u(m) T = u(M(m)D) T refer to the right and left 
Perron vectors o/M(m)D. 
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Then 



where 



r(M(m)D) = J2 X Ai [{l-m)r B + m\ Bi ) yi {m) 2 , (13) 



(itid K is /rom i/ie canonical form in Lemma\^ 

Proof. One can represent M(m) using the canonical forms (O, ((SJ: 

M(m) = Dy 2 D u 1 / 2 KA A [(l-m) r B I + mAu^D"^}/ 2 . (15) 

This form will be used to produce a symmetric matrix similar to M(m)D, which 
allows use of the Rayleigh-Ritz variational formula for the spectral radius. The 
expression will be seen to simplify to the sum of squared terms. 

For brevity let * m := KA^I-to^bI + mA B ]K T , so M(m) = E* m E _1 
where E = ' 2 Du^ J from (fTTj). Since M(m) > and E is a positive diagonal 
matrix, then <& m > 0. The following identities are obtained by multiplication 
on both sides by E, D 1 / 2 , and their inverses: 

r(M(m)D) = r(E<I> m E _1 D) = r(f m E _1 DE) = r(* m D) = r(D 1 / 2 $ m D 1 / 2 ) 
= r(S m ), 

where 

S m := D 1 / 2 *,^ 1 / 2 = D 1 / 2 KA A [(1 - m)r B I + mAs^D 1 / 2 (16) 
= D 1/2 E- 1 M(rn)ED 1/2 (17) 

is symmetric, so we may apply the Rayleigh-Ritz variational formula for the 
spectral radius [HI Theorem 4.2.2, p. 176]: 

r(S m ) = max * ^ mX . (18) 

This yields 



r(M(m)D) = r(S m ) = max x S m x 

X T X— 1 



= max x T D 1/2 KA j4 [(l - m)r B l + ?tiA b ]K t D 1/2 x. (19) 

X T X=1 

Any x that yields the maximum in (|19p is an eigenvector of S m [131 p. 33]. Be- 
cause M(m) is irreducible and D and E are positive diagonal matrices, M(m)D 
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and S m = D 1 / 2 E _1 M(m)ED 1 / 2 are irreducible. By Perron-Frobenius theory, 
x(m) > is therefore the unique left and right Perron vector of S m . This allows 
one to write 

r(M(m)D) = x(to) t D 1/2 KA a [(1 - m)r B l + mA B ]K T D 1/2 x(m). (20) 
Define 

y(m) := K T D 1/2 x(m). (21) 

Substitution of ([21]) into <(20]) yields ([T3j) : 

r(M(m)D) = i{mfT> 1/2 KA A [{l-m)r B l + toA b ]K t D 1/2 x(m) 
= y(ni) T A A [(l-m)r B l + mA B ] y(m) 

n 

= ^ ^Ai[{^-rn)r B + m\ Bi ] yi(m) 2 . 

i=l 

Next, y(m) will be solved in terms of v(m) and u(m) by solving for x(m), using 
the following two facts. For brevity, define A m :~ A^[(l — m)r B l + mA B ], so 
M(to) = EKA m K T E _1 : 

1. r(M(m)D) v(m) = M(ro)Dv(m) = EKA m K T E 1 D v(m); (22) 

2. r(M(m)D) x(m) = D 1/2 KA m K T D 1/2 x(m). (23) 

Multiplication on the left by ED -1 / 2 on both sides of (|2"3"|) reveals the right 
Perron vector v(m) = v(M(m)D): 

r(M(m)D) (ED~ 1/2 )x(m) = (ED~ 1/2 )D 1/2 KA m K T D 1/2 x(m) 
= EKA m K T (E- 1 DED- 1 )D 1/2 x(m) 
= (EKA m K T E- 1 D)(ED- 1/2 x(m)) 

= M(m)D(ED- 1/2 x(m)), (24) 

which shows that ED _1 / 2 x(m) is the right Perron vector of M(m)D, unique 
up to scaling, i.e. 

v(m) = c(m) ED~ 1/2 x(m) = c(m) Di /2 D u 1/2 D" 1/2 x(m), 

for some c(m) to be solved, which gives 

*(m) = -i-D- 1/2 Dy 2 D 1/2 v(m). (25) 
c(m) 

The normalization constraint x(m) T x(m) = 1 yields 

1 = x(t7j) t x(to) = — — -vfmfD^DuDvfm), 
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so 



c(m) = yvM T D- 1 D u Dv(m). (26) 
Substitution for x(m) now produces (| 14|) : 

y(m) := K T D 1//2 x(m) = K^D^-i-D^B^D 1 / 2 v(m) 

c(m) 

1 _ K T D- 1 / 2 Dy 2 Dv(m). (27) 



/v(m) T D7 1 D u Dv(m) 
Each element of y(m) is thus 



hence 



Vl{m) = ; n. = E K * ~2 • 



„,„,r = I [ Sp K , D 3 U ) ,2v M ) 

[ ' V- , ,2DjU 3 2^ A ^ 1/2 



□ 

Theorem 7 (Main Result). Let A and B be n x n symmetrizable irreducible 
nonnegative matrices that commute with each other, with Perron roots rA = 
r(A) = Xai and rs = r(B) = Xbi, and common left and right Perron vectors, 
u T and v. Let D fee a nonscalar positive diagonal matrix, and suppose 

M(m) := A[(l-m)r B I + mB], me [0,1]. (28) 

CI. Lf all eigenvalues of A are positive, then 

-^-r(M(m)D) < 0. (29) 
dm 

C2. Lf all eigenvalues of A other than r A — Ai(A) are negative, then 

-^-r(M(m)D) > 0. (30) 
dm 

C3. IfX Al = for all i € {2, . . . , n}, then dr(M(m)D)/dm = 0. 

C4- Lf CI or C2 hold except for some i £ {2, ...,n} where Xai — 0, then 
inequalities (1291) and (|30[) are replaced by non-strict inequalities. 
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Proof. The sum-of-squares form in Theorem [5] is now utilized to analyze the 
derivative of the spectral radius. For an arbitrary irreducible nonnegative matrix 
F(m) that is a differentiable function of m, the derivative of its spectral radius 
follows the general relation [H Sec. 9.1.1]: 

A r (F(m)) = u(F(m)) T dF ^ v(F(m)). (31) 
dm dm 

The derivatives of u(F(m)) and v(F(m)) do not appear in (|3"Tj) because they 
are critical points with respect to r(F(m)). Application of (j3"T1) gives 

Ar(MHD) = x(m) T D'/ 2 K A<1 d[(1 = + k^D 1 / 2 x(m) 

dm dm 

= x(m) T D 1/2 KA A (A B - r B I)K T D 1/2 x(m). 
Substitution with y(m) := K T D 1 / 2 x(m) yields: 

- d -r(M(m)D) = y(m) T A A (A B - r B I)y(m) 
dm 

n 

= ^2^Ai(^Bi - r B )yi(m) 2 . (32) 
i=i 

We know the following about the terms in the sum in (|32|) : 

1. Xgi — rg = 0. Thus the first term i = 1 of the sum is zero. 

2. For i G {2,...,n}, A^ — r B < 0, hence (\b% — rB)yf < 0. Since B 
is symmetrizable, A^i S K. Since B is irreducible the Perron root has 
multiplicity 1, and \\bi\ < tb BCD Theorems 1.1, 1.5]. Together these 
imply \ Bl < r B for i <E {2, . . . , n}. 

3. yi{m) 7^ /or at least one i £ {2, . . . , n} 7 whenever D ^ cl /or any c > 0. 
Suppose to the contrary that yi(m) = for all i G {2, . . . , n}. That means 
y(m) = yi(m) e x so (|2"T|) becomes 

y(m) = yi(m)ei = c(m)- 1 K T D^ 1 ^ 2 Dy 2 D v. 

^ 1*2 1/2 

Now multiply on the left by nonsingular D v D u K and substitute 
[K]i =Dy 2 Di /2 e (51]): 

yi (m) (Dy 2 D u 1 / 2 K)e 1 = Vl (m) D^D- 1 /^ 

= yi (m)Dy 2 D u 1 / 2 D y 2 Dy 2 e 

= y% (m)v 

= c(m)- 1 (Dy 2 D u 1 / 2 K)K T D- 1 / 2 Dy 2 Dv 
= c(m) _1 D v. 

Hence v = (yi(m)/c(m))Dv > 0, which implies D = (c(m)/yi(m))I, 
contrary to hypothesis that D ^ cl for any c > 0. Thus D being nonscalar 
implies that yi{m) ^ for at least one i £ {2, . . . , n}. 
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Combining points[3], and[31 above, we have (X B i — rB)yi(m) 2 < for at least 
one i S {2, . . . , n}, while from point [T], Aai(Abi — r B )yi(m) 2 = 0. Thus, if the 
signs of A^i, i = 2, . . . , n are the same, the nonzero terms in the sum in (|32p 
all have the same, opposite sign, and there is at least one such nonzero term. 
Therefore, 

1. if \ai > for all i, then \Ai{^Bi — r B ) < Vi > 2, thus 

— r(M(m)D) = ^A Ai (A B i - r B )yf(m) < 0; 

i=2 

2. if \ai < for i = 2, . . . , n, then A^Ab, — r B ) > Vi > 2, thus 

d " 
— r(M(m)D) = ^ A A i(A B i - r B )y 2 (m) > 0; 

i=2 

3. If \m = for all i G {2, . . . , n}, then all the terms in (|32l) are zero so 
dr(M(m)D)/dm = 0. 

4. If Axi = for some i € {2,...,n}, we cannot exclude the possibil- 
ity that yiijn) happens to be the one necessary nonzero value among 
y2(7n), . . . ,y n (m), while yj(m) = for all j ^ in which case all the 
terms in (|3"2"j) would be zero. Thus inequalities in (|29p and (|30l) become 
non-strict if any Xai = 0. 

If the non- Perron eigenvalues of A are a mix of positive, negative, or zero values, 
there may be positive, negative, or zero terms XAi{X B i — r B)y1 for i = 2, . . . , n, 
so the sign of dr(M(m)D)/dm depends on the particular magnitudes of Aa«, 
Asi, rs, and ?/i(m). 

□ 

Corollary 8. For the case n — 2 of Theorem^ dr(M(m)D)/dm ftas i/ie 
opposite sign of Xa2 and is zero if Xa2 — 0. 

Proof. When n = 2, A has only one eigenvalue besides the Perron root. There- 
fore D^cl implies y 2 (m) ^ in ([3"2"J), so (A B2 -r B )y\ < 0. Thus 

-^-r(M(m)D) = Aai(A B i - r B )y\ + A A2 (A B 2 - r B )y\ 
am 

= r A *0*yj + X A 2(X B 2 - r B )y\. 

Therefore dr(M(m)D)/dm has the opposite sign of Xa2, or is if Xa 2 = 0. □ 

The following is immediate: 

Corollary 9. In Theorem [?J the term r B I in (|28p may be replaced by any 
symmetrizable nonnegative matrix C that commutes with A and B for which 
r(C) = r(B) and Aci > X B i, i = 2, . . . , n. 
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Note that the indices i E {2, . . . , n} are not ordered here by the size of the 
eigenvalues as is commonly done, but are set by the arbitrary indexing of the 
non-Perron eigenvectors. 

Corollary 10 (Partial answer to Cohen's open question). Let A be an n x n 

symmetrizable nonnegative matrix and D be a positive diagonal matrix. If all 
of the eigenvalues of A are positive, then r(A)r(AD) > r(A 2 D). // all of the 
non-Perron eigenvalues of A are negative, then r(A) r(AD) < r(A 2 D). When 
A is irreducible and D is nonscalar, then the above inequalities are strict. If all 
of the non-Perron eigenvalues of A are zero, then r(A) r(AD) = r(A 2 D). 

Proof. Let A be irreducible and D nonscalar. Apply Theorem [7] with A = B. 
Then M(0) = r(A)A and M(l) = A 2 . If all the eigenvalues of A are positive, 
then by Theorem dr(M(m)D)/dm < 0, so r(M(Q)D) = r(A)r(AD) > 
r(A 2 D) = r(M(l)D). If all the non- Perron eigenvalues of A are negative, then 
dr(M(m)D)/dm > 0, so r(M(0)D) = r(A)r(AD) < r(A 2 D) = r(M(l)D). If 
\ Ai = for i e {2, . . . , n} then dr(M(m)D) /dm = so r(A)r(AD) = r(A 2 D). 

If D = cl for c > then r(A) r(AD) = cr(A) 2 = cr(A 2 ) so equality 
holds. A reducible symmetrizable nonnegative matrix A is always the limit of 
some sequence of symmetrizable irreducible nonnegative matrices, for which the 
eigenvalues remain on the real line. If \ai, i = 2, . . . , n are all negative or all 
positive, then they continue to be so for these perturbations of A by the conti- 
nuity of the eigenvalues. For each perturbation, the sign of dr(M(m)D)/dm is 
maintained, but in the limit equality cannot be excluded, so only the non-strict 
versions of the inequalities are assured for reducible matrices. □ 

Theorem 11 (Main Result Variant). Let A and B be n x n symmetrizable ir- 
reducible nonnegative matrices that commute with each other, with equal Perron 
roots r{A) — r(B) = A^i = Abi, and common left and right Perron vectors, 
and v. Let Aai and Xsi, i = 2, . . . ,n be the non-Perron eigenvalues. Let D be 
a nonscalar positive diagonal matrix, and suppose 



1. If Xai > X B i for i = 2,...n, then dr(M(m)D)/dm < and r(AD) > 
r(BD). 

2. If Xai < Xbi for i = 2,...n, then dr(M(m)D)/dm > and r(AD) < 
r(BD). 

3. If \ A i = A Bi for i = 2,...n, then dr(M(m)D)/dm = and r(AD) = 
r(BD). 

4- If Xai — Asi for at least one i G {2, . . . , n}, then the inequalities m[7] and 
[H are replaced by non-strict inequalities. 

Proof. The proof follows that of Theorem [7] with some substitutions. M(m) has 
the canonical representation 



M(m) 



(1 



m)A + mB, 



m e [0,1]. 



(33) 



M(m) := (l-m)A + mB = Dy 2 D" 1/2 K[(l-m) A A + mA B ]K T D- 1/2 Dy 2 . 
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The spectral radius has the sum-of-squares form as developed in (fToT) - (|2"T|) , 
where x(m) is as given in (|25|) . and the derivative of the spectral radius evaluates 
to 

j-r(M(m)D) = x(m) T D 1 / 2 K d[(1 ~ m) * A + ^' ^^(m) 
dm dm 

= x(m) T D 1/2 K(A B - A A )K T D 1/2 x(m). 
Substitution with y(m) := K T D 1 / 2 x(m) yields: 

j « 

— r(M(m)D) = y(m) T (A B - A A ) y(m) - £ (X Bi - \ Ai ) Vl (rnf . (34) 

i=i 

The relevant facts about are: 

1. Asi — A^ = by construction. Thus the first term i = 1 of the sum is 
zero. 

2. yi(m) ^ for at least one i G {2, . . . , n}, whenever D ^ cl for any c > 0, 
as in Theorem [7] 

If Ayii > \bi for i = 2, . . . , n then all of the terms (A Bi — A^,) yi(m) 2 in (|3~4l 
are nonpositive, and at least one is negative, therefore dr(M(m)D)/dm is nega- 
tive. If \ A i < Asi for i = 2, . . . , n then all of the terms in (|34[) are nonnegative, 
and at least one is positive, therefore dr(M(m)D)/dm is positive. If \a% = As; 
for i = 2, . . . , n all of the terms in (f34| are zero so dr(M(m)D)/dm = 0. 

As in Theorem[7l if Aa; = Asi for some i £ {2, . . . , n}, we cannot exclude the 
possibility that happens to be the necessary nonzero value among y^, ■ ■ ■ , y n 
while yj = for all j ^ 1, i, in which case all of the terms in (|34|) are zero so 
dr(M(m)D)/dm = 0. Thus inequalities in Q] and [2] become non-strict if there is 
a single equality between non- Perron eigenvalues of A and B. □ 

Remark 12. It is notable here that the relation on the Perron root, r(AD) > 
r(BD), is determined by the non-Perron eigenvalues of A dominating those of 
B, for A and B with the same Perron root and nonscalar D. Domination here 
means Xai > ^Bi where i is the index on the non- Perron eigenvectors as ordered 
in K. One may ask whether this relation holds if K is merely invertible rather 
than orthogonal as required for simultaneous symmctrizability of A and B. 



3 Applications 

The inequalities examined here arise naturally in models of population dynam- 
ics. Karlin derived Theorem [5] [T5J Theorem 5.1] in order to analyze the pro- 
tection of genetic diversity in a subdivided population where M is the matrix of 
dispersal probabilities between patches. He wished to establish a partial order- 
ing of stochastic matrices M with respect to their levels of 'mixing' over which 
r(MD) decreases with increased mixing. Another partial ordering of matrices 
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examined by Karlin is of the form M(m) = (1 — to) I + mP with mixing param- 
eter m. Karlin's Theorem 5.2 [15] shows that r(MD) decreases with increasing 
to, strictly when P is an irreducible stochastic matrix and D is a nonscalar 
positive diagonal matrix. 

Variation in (1 — to)I + toP over to represents variation in the incidence 
of a single transforming processes (such as mutation, recombination, or disper- 
sal) that scales all transitions between states equally. However, many natural 
systems have multiple transforming processes that act simultaneously, in which 
case the variation with respect to a single one of these processes generally takes 
the form (1 — to)Q + toP where Q is also a stochastic matrix. Karlin's Theorem 
5.2 does not apply for general Q ^ I. The motivation to develop Theorem [T71 
[2 Theorem 2], below, was to extend Karlin's Theorem 5.2 to processes with 
multiple transforming events. 

An open problem posed in [Tj and [5] is the general characterization of the 
matrices Q, P, and D such that r([(l — to)Q + toP]D) strictly decreases in to. 
Theorem [3] [3 , Theorem 33] goes part way toward this characterization. 

3.1 Temporal Properties 

Theorem was obtained to generalize a model by McNamara and Dall [TB] of a 
population that disperses in a field of sites undergoing random change between 
two environments, where each environment produces its own rate of population 
growth. In the generalization [3], the environments are modeled as a reversible 
Markov chain with transition matrix P, and Q = limj^ooP'. For P to have 
all negative non- Perron eigenvalues means that the environments change almost 
completely from one time increment to the next, while all positive eigenvalues 
correspond to more moderate change. 

The correspondence originally discovered by McNamara and Dall [18] was 
between the sojourn times of the random environmental states, and whether 
natural selection was for or against dispersal. The generalization of their model 
shows that the direction of evolution of dispersal and the sojourn times of the 
environment are both determined by conditions CI and C2 on the signs of the 
non-Perron eigenvalues of the environmental change matrix [3] Theorem 33]. 
More specifically, it is an inequality on the harmonic mean of the expected 
sojourn times of the Markov chain that is determined by conditions CI and C2. 
The inequality derives from a remarkably little-known identity. 

Lemma 13 (Harmonic Mean of Sojourn Times [3] Lemma 32]). For a Markov 
chain with transition matrix P, let Ti(P) be the expected sojourn time in i (the 
mean duration of state i), and let {Ai(P)} be the eigenvalues of P. Let Ea and 
Eh represent the unweighted arithmetic and harmonic means, respectively. 
These are related by the following identities: 



E H (T i (P))(l-E A (*t(P))=l 1 



(35) 
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or equivalently 



I should qualify "little known" — a version of (|35|) is well-known within the 
field of research on social mobility, but no reference to it outside this community 
appears evident. The identity arises in Shorrock's [22] social mobility index 



i " 

M(P) = Y,(i- PiiJ , 

= 1 



where is the probability of transition from social class j to class i. Shorrocks 
notes that M(P) is related to the expected sojourn times ('exit times') for each 
class i, Ti = 1/(1 — Pa), through their harmonic mean, 

E H (n) := ' 



n h T * 



Evaluation gives 
E H 



1-Pii 



n 



yielding 



M(P) = ^-rX>- P ») = 
71—1 T - ? 



1-Pu 



Geweke et al. [12] define another social mobility index, 

-E?=i|Ai(P)| 



M E (P) 



71-1 



They note that when all the eigenvalues of P are real and nonnegative, then 
M(P) = Me(P), by the trace identity J27=i P ™ = E^i M P )- Numerous 
papers cite this correspondence [THl HO] • However no expression of the identity 
in terms of the harmonic and arithmetic means, as in the forms (|35l) or (|36l) . is 
evident. 

Next, the eigenvalue conditions CI and C2 are applied to the identity (|35| . 



Theorem 14 (From [3] Theorem 33]). Let P be the n x n transition matrix of 
an irreducible Markov chain whose eigenvalues are real. Let Tj(P) = 1/(1 — Pa) 
be the expected sojourn times in state i. 
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CI. If all eigenvalues ofP are positive, then 



E H ( Tl (P))>l + -. (37) 

n— 1 



C2. If all non-Perron eigenvalues of P are negative, then 



1 < E H ( n (P)) < 1 + (38) 

C3. If all non-Perron eigenvalues of P are zero, then 

l<E H ( n (P)) = l + ^—. (39) 
n— 1 

C^. // aZ/ non-Perron eigenvalues o/P are i/ie some sign or zero, and at least 
one is nonzero, then inequalities (|37p and ()38[) are unchanged. 

Proof. The following equivalent inequalities are readily derived from one an- 
other: 

£ " (T '^T^T >1 + ^I^ (40) 

i—i 

^==> n-i > — = — = n — Pa 

n n n 

1< ^P l2 = 5>(P) = l + ^A,(P) 

n 

«=>0<$>(P). (41) 

4 = 2 

The analogous equivalence holds if the directions of the inequalities are reversed. 
If Ai(P) > for all % then l(4T)). ([i0|. and ([37| hold. Conversely, if A,(P) < 
for i = 2, . . . , n then X)"=2 ^»(^*) ^ ^, reversing the direction of the inequalities, 
and the right side of ((38]) holds. The left side of (|38|) clearly holds since r 4 (P) = 
1/(1- P«) > 1 for each i. If A*(P) = for i = 2, n then clearly E H (n) = 1 + 
l/(n-l). If Ai(P) > for i e {2, . . .,n}, and A»(P) > for some ie{2,...,n} ) 
then ^™_ 2 Aj(P) > so (14"TT) continues to hold; analogously for the reverse 
inequality. □ 

We have seen that conditions CI and C2 are sufficient to determine oppo- 
site directions of inequality in two very different expressions, one involving the 
temporal behavior of a Markov chain, Ejj(ti(P)) > 1 + l/(ra— 1), and the 
other involving the interaction of the chain with heterogeneous growth rates, 
dr(P[(l - m)I + mQ]D)/dm < and r(PD) > r(P 2 D) (under condition CI; 
the reverse under C2). 
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The inference in these results goes from the eigenvalue sign conditions, CI 
and C2, to the inequalities. The converse, an implication from the inequality 
directions to the eigenvalue sign conditions, is found only in the case n = 2. It 
would be of empirical interest to know if there exist classes of stochastic matrices 
P for n > 3 in which the temporal behavior has direct implications upon the 
spectral radius, i.e. £jj(Tj(P)) tells us about dr(P[(l — m)I + mQ]D)/dra and 
r(P 2 D)/r(PD), or vice versa, without recourse to conditions CI and C2. 

An example of such a class for n > 3 is devised using rank-one matrices. Let 
V n be the set of probability vectors of length n, so e T x = 1, x > for x e V n - 
Define the set of stochastic matrices 

TZ n := { (1 - ar)I + «v e T : v E V n , v > 0, a E (0, min — - — 1 1 . 

Corollary 15. Let P E lZ n , let Q be a symmetrizable irreducible stochastic 
matrix that commutes with P 7 and let D be an n x n nonscalar positive diagonal 
matrix. Then 

d -r(P[(l -m)I + mQ]D) < if and only if E H (Ti(P)) > 1 ' 



dm n— 1 

-^-r(P[(l - m)I + mQ]D) = if and only if E H (Ti(P)) = 1 + , 
dm n— 1 



-^-r(P[(l - m)I + mQ]D) > if and only if 1 < ^(^(P)) < 1 



n-1 



Corollary 16. Lef P 6 TZ n , and let D be an n x n nonscalar positive diagonal 
matrix. Then 

r(P 2 D) < r(PD) j/ and onZy if E H {n(P)) > 1 + ' 



n— 1 ' 



r(P 2 D) = r(PD) if and only if E H (n(P)) = 1 ' 



n— 1 



r(P 2 D) > r(PD) if and on/y i/ 1 < S ff (n(P)) < 1 ' 



n-l 

Proof. The upper bound on a is the largest that assures An = 1 — a + avi > 
for each i. In addition it gives 

1-ae [1-mia- ,l) = T-min— ^— ,l). (42) 

L i 1—Vi ' i 1 — Vi 

Any P € 72.„ is irreducible since by hypothesis v > 0, a > 0. To apply Theorem 
we must verify that P £ 7?. n is symmetrizable: 



P = (1 - a)I + ave T = Dy 2 [(l - a)I + a(Dy 2 ee T Dy 2 )]D- 1 



-1/2 
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Let Zj be a right eigenvector of P G 7Z n associated with Ai(P). Then 
AjZj = Pzj = [(1 — a)I + av e T ]zi = (1 — a)zi + av( e T Zj) 
(Ai — 1 + a)zi = av( e T Zi) 

e T z 4 = 0, Ai = 1 - a or z., = a ( e z ') v _ v ^ _ ^ 

Ai + a — 1 

So either (1) Zi is the right Perron vector of P, or (2) Ai = 1— a G [— miiij ^ , 1), 
by (f42|) . Thus all of the non- Perron eigenvalues of P are equal, and may be either 
positive, zero, or negative in the range [—mini 1 , 0), in which case exactly 
one of conditions CI, C3, or C2 is met, respectively, for Theorems [71 and [T4l and 
Corollary [TU1 with the consequent implications. □ 

3.2 Kronecker Products 

A notable class of matrices that exhibit the commuting property required for 
Theorems [2l [3l [U and[7Jis the Kronecker product of powers of matrices. Define 
a set of square matrices 

C:={A 1 ,A 2 ,...,A i }, 
where each Ai is an rii x rii matrix. Define 

L 

M(t) := (g) A*' = A 4 ! 1 ® A f 2 2 ® • • • ® A% , (43) 

i=l 

where ® is the Kronecker product (a.k.a. tensor product), G N := {0,1,2,...}, 

L 

, ti 



and t G N L . Now define the family of such products: 



^(C) = ^(V)A i :t i e {0,1,2,...} 



, i=l 



Clearly, any two members of ^(C) commute, because for any p,q G N L , then 

L 

M(p) M(q) = (g) Af +9< = M(q) M(p). 

»=i 

Products of the form arise in multivariate Markov chains for which 

each variate Xi constitutes an independent Markov chain with transition matrix 
Ai . The joint Markov process is exemplified be the transmission of information 
in a string of L symbols where transmission errors occur independently for each 
symbol. Such a process includes the genetic transmission of DNA or RNA 
sequences with independent mutations at each site. Under mitosis, the genome 
replicates approximately according to a transition matrix for a string of symbols 
with independent transmission errors at each site i: 

L 

M m := (g) [(1 - rm)Ii + WiPi] = A[(l - m k )I + m k B], 

z=l 
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where is the probability of a transforming event at site i, and Pi is the 
transition matrix for site i given that a transforming event has occurred there. 
The form A[(l — mj.)I + m^B] is provided to show the relationship to Theorem 
where k may be any choice in {1, . . . , L}, with 

A = Ai <g> A 2 ® • • • ® A fc _i I*, ® A fc+ i <g> • • • <8) A L , (44) 



where Aj = (1 — mj)I, + TOjPj, and 



B = Ii (gi I 2 <g> • ■ • <g> I/c-i ® P/c ® I/c+i ® • • ■ <8> II- (45) 




However, both A and B in f|44[) and (|45|) are reducible due to the I terms, 
and this somewhat alters Theorem [TJs condition on D for strict monotonicity 
of spectral radius. This is seen in Theorem 1171 

Theorem 17. Theorem 2] Consider the stochastic matrix 

L 

M m := (g) [(1 - m ? )I s + m c P c ], (46) 

where each zs an x transition matrix for a reversible aperiodic Markov 
chain, the x identity matrix, L > 2, and m G (0,1/2) L . Let D 6e a 
positive N x N diagonal matrix, where N :— n C- 

Then for every point m 6 (0, 1/2) L , the spectral radius of 



M m D 



is non-increasing in each . 
If diagonal entries 

Da .,■ , D. ■> ■ (47) 

differ for at least one pair i^,i'^ £ {1, . . . , n^}, for some i\ G {1, . . . , ni}, . . ., 
G {1, . . . , ri£-i}, G {1, . . . , n^+i}, . . ., i L G {1, . . . , n L }, then 

Remark 18. The condition on the diagonal entries (|47| can be simply expressed 
in the cases £ = 1 and £ = L, respectively, as a requirement that D ^ cli (g> D' 
and D 7^ D' £g) cIl for any c G K and any D'. For £ G {2, . . . , L — 1} similar 
expressions could be given by employing permutations of the tensor indices. 

Theorem [17] was obtained to characterize the effect of mutation rates on a 
clonal population, or on a gene that modifies mutation rates in a non-recombining 
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genome. This theorem shows that the asymptotic growth rate of an infinite 
population of types {(ii, i 2 , . . . , is a strictly decreasing function of each 
mutation rate when the growth rates Z)j in (|T7|) differ, and non-increasing 
otherwise. All the eigenvalues of M m are positive, as in condition CI in Theo- 
rem[7l due to the assumption < 1/2 for £ e {1, . . . , L}. 

The asymptotic growth rate of a quasispecies |llj at a mutation-selection 
balance is thus shown by Theorem [17] to be a decreasing function of the mu- 
tation rate for each base pair, a result not previously obtained with this level 
of generality in the multilocus mutation parameters, mutation matrices, and 
multilocus selection coefficients. As a practical matter, however, in genetics L 
may be very large, for example L m 6 x 10 9 for the human genome. For such 
large L, populations cannot exhibit the Perron vector as a stationary distribu- 
tion since the population size is infinitesimal compared to the genome space of 
n = 4 L . However, in large populations models that examine a small-!/ portion 
or approximation of the full genome, the Perron vector may become relevant as 
the stationary distribution under selection and mutation. 

Proposition 19. Theorem \17\ extends to general symmetrizable irreducible non- 
negative matrices 

L 

M m = (g) [(1 - m^)r(A e )I e + m e A e ] , 

where each is a symmetrizable irreducible nonnegative x ri£ matrix. 
Proof. For any given A, let u^ T be its left Perron vector, and define 

P = KX) D - AD -- (48) 

Then P is a symmetrizable irreducible stochastic matrix: 

1. P > since r(A) > 0, and ua > 0. 

2. P is stochastic, since 

3. P is symmetrizable: 
P = ^rTD UA AD u ] = -^-D UA D L SD R D-] = D' L SD' R , 



r(A) UA r(A) 



where 



D ' L = kX) Du - Dl ' d 'r = d r d u1- 
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4. P is irreducible since [A'jy > if and only if 

[P%- =r(A)~* <x Ai [A*]y > 0. 

The spectral radius expression with A^ terms is now shown to be equivalent to 
one with P^ terms: 



r(M m D) = r ( (g) [(1 - m 4 )r(A ? )I ? + m ? A ? ] D 

L 



11^) 



?=1 
L 



?=1 
L 



(1 — m^)I^ + 



r(Ai 



D 



= f[ r ( A ?) r ((g) (du,, [(1 - mtte + m ? -^A s ]D-^ ] D 
f=i V=i "> k U J 



(1 — m^)Ij + 



D 



= n r ( A ? ) »•( ® K 1 - ^ + ™« p «] d 

L 

= H r ( A e) r ( M m D ) ; 
where := (g)^ =1 [(1 - m ? )I ? + rn^Pf]. Therefore 

L 

dm 



r(M m D) = n^)— « 



9m 



(49) 



Theorem [T71 being applicable to the right hand side of (|4"9")l . is thus extended to 
the left hand side involving general symmetric irreducible nonnegative matrices. 

□ 

Remark 20. The identification P = ^^yD U4 AD~^ (|48|) used in Proposition 
[19] also provides another route to extend Theorem [3] to Theorem sidestepping 
Lemmas H] and El Theorem [6j and the proof of Theorem [7] But analogous steps 
are the basis for the proof of Theorem [3] in [3] , so they are provided here to be 
self-contained. 



3.3 Other Applications 

Condition C2 is met by nonnegative conditionally negative definite matrices 
[3 Chapter 4], [7j. Symmetric conditionally negative definite matrices arise in 
the analysis of the one-locus, multiple-allele viability selection model. If the 
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matrix of fitness coefficients W allows the existence of a polymorphism with all 
alleles present, then the polymorphism is globally stable if W is conditionally 
negative definite [T7] (Kingman's exact condition being that they need only be 
conditionally negative semidcfinitc) . 

4 Open Problems 

The conditions in Theorem [7] that all the eigenvalues of A be positive (CI), or 
that all the non- Perron eigenvalues be negative (C2), are clearly very strong, 
and leave us with no results for intermediate conditions. One can no doubt 
obtain intermediate results by placing additional conditions on the matrices E, 
K, and D, but this remains an unexplored area. However, it is seen that for 
n = 2 the conditions CI and C2 cannot be weakened. 

The condition of symmetrizability also imposes a large constraint on the 
generality of the results here. The inequalities described undoubtedly have 
extensions to general nonnegative matrices. For the more general case, however, 
we lose use of the Rayleigh-Ritz variational formula, and the spectral radius no 
longer has the sum-of-squares representation (p~3|) . which is our principal tool. 
The general case remains an open question. 
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